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Abstract — In this paper, a new approach for decoding low- 
rate Reed-Solomon codes beyond half the minimum distance is 
considered and analyzed. Unlike the Sudan algorithm published 
in 1997, this new approach is based on multi-sequence shift- 
register synthesis, which makes it easy to understand and simple 
to implement. The computational complexity of this shift-register 
based algorithm is of the same order as the complexity of the 
well-known Berlekamp-Massey algorithm. Moreover, the error 
correcting radius coincides with the error correcting radius 
of the original Sudan algorithm, and the practical decoding 
performance observed on a g-ary symmetric channel (QSC) is 
virtually identical to the decoding performance of the Sudan 
algorithm. Bounds for the failure and error probability as well 
as for the QSC decoding performance of the new algorithm are 
derived, and the performance is illustrated by means of examples. 

Index Terms — Reed-Solomon codes, decoding beyond half 
the minimum distance, Interleaved Reed-Solomon codes, multi- 
sequence shift-register synthesis, 

I. Introduction 

Reed-Solomon codes are usually decoded algebraically 
by Bounded Minimum Distance (BMD) decoders which are 
capable of decoding all error patterns up to half the minimum 
distance, and no error patterns beyond this limit. Such classical 
BMD decoding approaches are mostly syndrome based, i.e., 
they work by creating a syndrome from the received word, 
which allows to uniquely reconstruct the error pattern, pro- 
vided that its weight is not larger than half the minimum 
distance of the code. There have been several straight forward 
approaches like [1] and [2] for extending these syndrome 
based techniques for decoding errors beyond half the minimum 
distance using an extensive post processing step. However, for 
complexity reasons the decoding radius can usually only be 
slightly increased using such techniques. 

When the original Sudan algorithm was published in 1997 
[3], it attracted a lot of interest among experts, since it 
allows a Reed-Solomon code of rate R < i to be decoded 
beyond half the minimum distance using algebraic methods. 

The research work of Vladimir R. Sidorenko and Georg Schmidt is sup- 
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More precisely, the Sudan algorithm considers the problem 
of decoding a Reed-Solomon code as bivariate polynomial 
interpolation and polynomial factorization problem, which can 
be solved in polynomial time. 

In [4] we presented a syndrome based decoding method, 
which uses multi-sequence shift-register synthesis to decode 
Reed-Solomon codes of rate R < | beyond half the minimum 
distance. In this paper, we analyze this decoding method 
and demonstrate, that it practically gives the same decoding 
performance as the original Sudan algorithm [3]. 

II. BMD Decoding of Reed-Solomon Codes 

Unlike the Sudan algorithm which is based on polynomial 
interpolation, our new decoding approach aims at utilizing 
classical syndrome based methods. More precisely, we use an 
approach based on multi-sequence shift-register synthesis for 
locating the erroneous positions in the received words. To ease 
understanding of this approach, and to introduce the notation, 
we start with a brief overview over classical syndrome based 
BMD decoding of Reed-Solomon codes based on shift- 
register synthesis. 

Reed-Solomon Codes 

For our purposes, it is most convenient to define Reed- 
Solomon codes in the frequency domain, i.e., by the spectra 
of their codewords. Roughly speaking, a spectrum of a Reed- 
Solomon codeword is "band-limited", i.e., it possesses at most 
k non-zero coefficients and at least n— k cyclically consecutive 
zero coefficients. Terms like "spectrum" or "band-limited" are 
usually related to the Fourier Transform. To use such terms 
in the context of coding theory, we start with defining the 
Discrete Fourier Transform (DFT) over the field ¥ q : 

Definition 1 (Discrete Fourier Transform in ¥ q ) 

Let p(x) = pa + pix + • • • + p n -\x n ~ l be a polynomial of 
degree deg(p(x)) < n — 1 overF g . Further, leta £ ¥ q besome 
element of order n. Then, the polynomial 

P( X ) = & (p(x)) =P +P lX + --- + Pn-iz"" 1 

whose coefficients are calculated by Pj = p(a l ) is called the 
Fourier Transform ofp(x) over the held ¥ q . 
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The inverse discrete Fourier Transform is calculated by 



p(x) = ^ (P(x)) = Po +PlX + ---+ p n _ 
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where P(x) is a polynomial of deg(P(x)) < n — 1, and p t = 
n~ 1 P{a~ l ). In the terminology of the Fourier Transform we 
call p(x) time domain polynomial and P(x) the frequency 
domain polynomial or the spectrum of p(x). 

Definition 1 enables us to describe Reed-Solomon codes 
by means of their codeword spectra. As already mentioned, 
the spectrum of a codeword of a Reed-Solomon code is 
characterized by n — k cyclically consecutive zero coefficients. 
For our purposes, it is not necessary to define Reed-Solomon 
codes in their most general form. For the sake of simplicity, we 
restrict our considerations to codes, whose codeword spectra 
are zero in the n — k rightmost positions. More precisely, we 
define a Reed-Solomon codes as follows: 

Definition 2 (Reed-Solomon code) 

Let 

(k-i -\ 



i=0 j 



{C(x)} 

be the set of all polynomials of deg(C(x)) < k with coeffi- 
cients Ci from F q , and let a £ ¥ q be some element of order n. 
Then, a Reed-Solomon code C = 1ZS (q; n, k) of length n and 
dimension k can be defined as the set of polynomials 



C = {c(x) 



&- x {C{x)) \C(x)e{C(x)}} 



The polynomials c(x) = cq + c\x - 
equivalently the vectors c = (cq, ci, . . 
codewords of the Reed-Solomon code. 



+ c„_ix" \ or 
_i), are called the 



To locate the erroneous positions in y(x), we use a tech- 
nique introduced by Peterson [5] and define a polynomial 
A(x) = A + Xix + • • • + A„_i.t" _1 such that Xj = 
whenever the corresponding coefficient ej of e(x) is non- 
zero, and Xj 7^ 0, whenever Sj = 0. In this way, we have 
Xj ■ ej = V j = 0, . . . , n — 1. Due to the well-known 
convolution theorem of the DFT, this relation is transformed 
into 

A(x) ■ E(x) = mod x n - 1 . (1) 

The t roots a - ^ 1 , . . . , oT' H of the frequency domain poly- 
nomial A(x) correspond to the locations ji,..-,jt of the 
erroneous positions in y(x). Therefore 

A(x) — A + Aix H h A t x* 

is called error locator polynomial. Since we are only interested 
in the roots of A(x) and Ao ^ 0, we are always able to 
normalize A(x) w.l.o.g. such that Ao = 1. 

Equation (1) gives rise to a linear system of n equations. 
From these equations, n — k — t equations only depend on 
the n — k coefficients from E(x), which coincide with the 
elements Sq, . . . , S n -k-x °f the syndrome, and the unknown 
coefficients of the error locator polynomial A(x). Hence, we 
extract a linear system of n — k — t equations and t unknowns. 
With m — n — k, this system of equations can be represented 
by the matrix equation 

St-i 

■ ■ s t 
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Due to the Polynomial Factor Theorem, a frequency poly- 
nomial C(x) has at most k — 1 roots. Hence, due to the 
properties of the DFT, a codeword has at most k — 1 zero 
coefficients, which means, that each codeword has at least 
weight n — k+1. Thus, by the Singleton bound, the minimum 
distance of a Reed-Solomon code pursuant to Definition 2 is 
exactly d = n — k + 1. 

BMD Decoding in the Frequency Domain 

Now, we assume that the codewords of C are transmitted 
over a noisy channel, which adds a random error polynomial 
e(x) = eo + eix + - ■ ■ + e n -xx n ^ 1 with t non-zero coefficients 
form W q . Hence, at the channel output we observe the word 



y(x) = c(x) + e(x) . 

To decode the errors in this word, we calculate a syndrome 
from y(x), which only depends on the error word e(x) 
and not on the codeword c(x). To do so, we utilize the 
spectral properties of a Reed-Solomon code. More precisely, 
we calculate the spectrum Y(x) = (y(x)) = C(x) + E(x), 
which is the sum of the codeword spectrum C(x) = & (c(x)) 
and the error spectrum E(x) = J?(e(x)). Then, we define 



Yi 



k+j 



E k +j,j 



0, ...,n-fc-l. Since deg(C(x)) < 
k, the sequence S — (Sq, Si, ... , S n ^k-i) om y depends on 
the error specturm E(x), and hence S acts as syndrome for 
decoding y(x). 



(2) 

If (2) has a unique solution, it can be used to calculate the 
coefficients of a unique error locator polynomial A(x). In 
this case, the locations of the erroneous symbols are uniquely 
determined. However, a unique solution can only exist, if the 
number of columns in S (number of unknowns) is not larger 
than the number of rows (number of equations), i.e., as long 
as 

n — k 



t < 



Moreover, from the Reed-Solomon code properties we know 
that S has full rank and therefore a unique solution must 
always exists as long as the number of errors is not larger 
than half the minimum distance. Consequently, the BMD error 
correcting radius is specified by 



n — k 




d-1 


2 
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Carefully considering Equation 2 yields that the matrix S 
is a Toeplitz matrix. This allows us to rewrite (2) in the form 
of the linear recursion 



t 



2_^A h Sj-h, j 

h=l 



t + 1. 



,2t. 



(3) 



Since this linear recursion can be realized by a linear feedback 
shift-register where the (unknown) coefficients of A(x) are 
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the feedback connection weights, the error locator polyno- 
mial A(x) can be determined by synthesizing the shortest 
possible linear feedback shift-register capable of generating 
S. This task is performed very efficiently by the well-known 
Berlekamp-Massey algorithm [6], [7]. If y(x) is erroneous in 
t positions, then the syndrome sequence has linear complexity 
t due to Blahut's Theorem [1], [8]. Hence, the Berlekamp- 
Massey algorithm will synthesize a shift-register of length t. 
As long as t < \J 1 y^\, (2) has only one solution, and conse- 
quently the Berlekamp-Massey algorithm always synthesizes 
a valid A-polynomial. 

After a valid error locator polynomial A (a;) is determined, 
its roots a _J1 , . . . , oT^ give rise to the positions j\, . . . , j t 
of the erroneous symbols in y(x). To evaluate the errors at 
these positions, i.e., to determine the values of the symbols 
ej 1 , . . . , e Jt we may use standard methods like the Forney 
Algorithm [9] or Recursive Extension [10]. 

III. Decoding Beyond half the Minimum Distance 

Clearly, the BMD decoding strategy described in the pre- 
vious section uses a spherical decoding region. To decode a 
code C C F™ of length n and minimum Hamming distance d, a 
BMD decoder surrounds each codeword c G C by a sphere of 
radius i max < r = as sketched in Fig. la. All vectors 

y within this sphere are decoded into c. Since t max < r, the 
decoding spheres are not overlapping, and consequently each 
vector y G F™ is included either in exactly one sphere, or in 
no sphere at all. Hence, all received vectors y within a sphere 
around c are uniquely decoded into c, and all vectors outside 
the spheres are not decodable. Thus, a BMD decoder either 
yields a unique decoding result (either correct or erroneous), 
or a decoding failure, i.e., no result at all. 

In contrast to this, a Maximum Likelihood (ML) decoder is 
capable of decoding errors beyond half the minimum distance. 
For this purpose, it covers the complete space F™ by irregular 
shaped Voronoi cells as depicted in Fig. lb. Roughly speaking, 
a Voronoi cell around a codeword c G C contains all vectors 
y G F^ 1 whose distance to c is smaller than the distance to 
any other codeword in C. Hence, an ML decoder covers the 
complete space F™, and consequently never yields a decoding 
failure. However, the complex shapes of the Voronoi cells 
are difficult to handle by algebraic methods, which means 
that known ML decoders generally possess a computational 
complexity, which grows exponentially in n. 

To decode errors beyond half the minimum distance by 
algebraic methods, a common strategy is to stick to spherical 
decoding regions, but increase the maximum decoding radius 
imax beyond half the minimum code distance. Such a strategy 
is commonly called Bounded Distance (BD) decoder. As 
illustrated in Fig. lc, it is inevitable for BD decoders that some 
of the decoding spheres are overlapping. Consequently, the 
received words y e FJ, which are included in more than one 
correcting sphere, cannot be uniquely decoded into a single 
codeword, since the codewords corresponding to all correcting 
spheres which contain the vector y are valid decoding results 
for the received word y. In such situations, basically three 
strategies are conceivable: 



1) The decoder selects one single codeword out of all valid 
decoding results 

2) The decoder returns a list of all valid decoding results, 
that is, a list of all codewords which lie inside a sphere 
of radius t max around the received word y. 

3) The decoder yields a decoding failure, i.e., it yields no 
solution at all 




(a) BMD decoding (b) ML decoding (c) BD decoding 



Fig. 1. Decoding regions for different decoding strategies. 

The Sudan algorithm and its derivatives like [11], [12], etc., 
apply the second strategy and yield a list of codewords if the 
decoding result is not unique. If a unique solution is required, 
one candidate of this list has to be selected (usually the one 
with the smallest distance to the received word y). In this case, 
the first strategy is applied. The third strategy is e.g. applied 
by [13], where the decoder may yield a decoding failure if 
multiple solutions exist. 

If the volume of the overlapping decoding regions are 
small enough, the probability for a unique decoding result is 
quite close to one. In this case, all three decoding strategies 
practically result in a similar decoding performance, since 
there are only a very few cases in which the different strategies 
yield different decoding results. 

We now describe a new decoding method with also applies 
the third strategy. We later demonstrate, that this approach 
practically yields the same QSC decoding performance as the 
classical Sudan algorithm [3]. 

A. Virtual Extension to an Interleaved Reed-Solomon Code 

Interleaved Reed-Solomon (IRS) codes like considered in 
[14], [15], [16], [17], [18], [19], [20], and others are capable of 
correcting error patterns beyond half the minimum distance. 
Usually, IRS code constructions are homogeneous, i.e., they 
consist of I codewords of one single Reed-Solomon code. 
However, as suggested in [15] and investigated in [20], we 
also can construct heterogeneous IRS codes by taking the I 
codewords from different Reed-Solomon codes with the same 
length but different dimensions. As shown in [20], decoding 
of both homogeneous and heterogeneous IRS codes can be 
performed very efficiently using multi-sequence shift-register 
synthesis. 

Like proposed in [4], to decode a single Reed-Solomon 
code beyond half the minimum distance, we can virtually 
extend the code such that it forms a heterogeneous IRS code as 
described in [20]. To do this, we proceed as follows: Consider 
the polynomial 

p{x) =p + pix H h p„_ia; ,l_1 
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over ¥ q and denote by p^ (x) the polynomial 

Consider the Reed-Solomon Code C = 1ZS (q; n, k), and 
define the codes 



C 



<!> = <j>>(x) :c(x) eC},! = l,...,l 



(4) 



Lemma 1 Let C = 1ZS (q; n, k) be a Reed-Solomon code. If 
i(k — 1) + 1 < n, then each codeword from is a codeword 
of the Reed-Solomon code 

C {i} = KS(q;n,i(k-l) + l) , 

of length n and dimension i(k — 1) + 1, or in other words, 
C<i> c C«. 

Proof: Every codeword (x) G corresponds to 
a codeword c(x) G C, where c(x) = (C(x)), and 

deg(C(a;)) < k — 1. Due to the properties of the discrete 
Fourier Transform, the spectrum of c^(x) is calculated by 
& (c^O)) = C\x), where deg(C l (x)) < i(k - 1). Hence, 
by by Definition 2, the polynomial 



J?-" 1 (C'On)) 



is a codeword of a Reed-Solomon code of length n and 
dimension — 1) + 1. ■ 

To transform a codeword c(x) = cq + c±x + ■ ■ ■ + c n -ix n ^ 1 
of a Reed-Solomon code C = 1ZS (q; n, k) into a codeword 
of a heterogeneous IRS code, we select a positive integer I, 
such that 

l(k - 1) + 1 < n 



(5) 



is satisfied. Then, we create the words 

c i = (eg, c\, . . . , c^.-jj , i 
According to Lemma 1, we have 

eC (?) , i = l,.. 
Hence, the I x n matrix 







fc 
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■ C n -i\ 
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c[ . 





consisting of the row vectors , c^> , . . . , c'" can be consid- 
ered as codeword of the heterogeneous IRS code C composed 
from the / Reed-Solomon codes C^\C (2 \ . . . ,C® . 

Now, we assume that we transmit c(x) G C over a channel, 
and observe the corrupted word y(x) = c(x) + e(x) at the 
channel output. To reconstruct the transmitted codeword, we 
do not want to decode y(x) with respect to C, but rather 
process y(x) in such a way that we are able to perform 
decoding with respect to the heterogeneous IRS code C. For 
this purpose, we calculate the I polynomials 



With the coefficients of these polynomials, we create the 
matrix 



Y = 



2} 



Wo 



Vl 



v\ 



Vn-l 
II 2 

Vn-l 



(6) 



Roughly speaking, the matrix Y can be considered as "re- 
ceived" matrix of an IRS code. More precisely, Y possesses 
the following properties: 

Lemma 2 Let c(x) e C be a codeword of a Reed-Solomon 
code C according to Definition 2, transmitted over a noisy 
channel. Assume that the word y(x) = c(x) + e(x) is received, 
and let Y be an I x n matrix composed from the coefficients 
of y(x) pursuant to (6). Then, if e(x) = 0, the matrix Y is 
a codeword of a heterogeneous IRS code composed of the I 
Reed-Solomon codes 

C w = IIS (q; n, i(k - 1) + 1) , i = 1, . . . , I. 

Moreover, if e(x) is a polynomial with t non-zero coeffi- 
cients ejj, ej 2 , . . . , ej t , then the matrix Y is a corrupted IRS 
codeword with exactly t erroneous columns at the positions 
3i,j2, ■ ■ -,3t- 

Proof: If e(x) = 0, then y(x) G C, and by Lemma 1 
and the explanations given above, we know that Y is a 
codeword of the heterogeneous IRS code C. If e(x) is a non- 
zero polynomial, the coefficients of the polynomial 

y {t) (x) = y l + vl* + ■ ■ ■ + Vn-i^ 1 

may be represented by 

Clearly, if ej = 0, that is, if j ^ {ji,j2, ■ ■ ■ >jt}> then 
= Vi = 1, . . . , I. On the other hand, if e 3 ^ 0, that 

is, if j G {ji,j2i ■ ■ ■ ,jt}, then may be non-zero. Hence, 
the first symbol, and in most cases also the other symbols of 
the columns j% , , . . . , j t in Y are erroneous, which means 
that there are exactly t corrupted columns at the positions 
ji,32, ■ ■ ■ , jt- ■ 



B. Decoding by Syndrome Extension 

Since due to Lemma 2 the matrix Y can be considered as 
corrupted codeword of a heterogeneous IRS code C, the matrix 
Y may be decoded as IRS code word using the collaborative 
decoding strategies described in [20]. In other words, we are 
able to create the syndromes 



n-i(fc-l)-2 



»] 

i(fe-l) + l' ' 



,Y. 



(7) 



y lS) ix) = y + y\x + --- + y i n _ l x r - 



L 



,1 



by taking the last n — i(k — 1) — 1 coefficients from the 
spectra Y<- 1 ' (x) = (j/W (x)) for i = 1, . . . , I. In this way, we 
obtain I — 1 auxiliary syndromes <SM with decreasing length 
in addition to the literal syndrome = S for the error word 
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e(x). These syndromes are used to form the following system which means that the maximum error correcting radius is 
of equations: 



increased to 



( S[1 \ 



/ A t \ 
A*_i 



/ T [i]\ 



(8) 




where 



rtii—t—1 ^vcii — t 



S 1 ' 



s „ 



( -sf \ 

q[i] 
J t+1 

and m, = n—i(k—l) — l. In principle, A(x) can now be found 
by solving (8) provided that S has full rank. This approach 
can be considered as generalization of the Peterson algorithm. 
Standard methods from linear algebra usually require a com- 
putational complexity of order O (n 3 ) for solving (8). 

IV. Error Correcting Radius and Threshold Rates 

For ordinary IRS codes, the parameter I is a freely se- 
lectable parameter, which determines the number of rows in 
a codeword matrix C G C. For an IRS code obtained by 
syndrome extension, the parameter I first of all has to fulfill 
(5). This means, that the maximum I for syndrome extension 
is somehow limited by the rate R = — of the original Reed- 
Solomon code C. Moreover, to be able to use the Ith auxiliary 
syndrome in the system of equations pursuant to (8) with 
t unknowns, mi > t has to be fulfilled. This restricts the 
maximum number of correctable errors for a given I. Hence, 
the maximum number of correctable errors is determined by 
the choice of I, which in turn is limited by the rate R of the 
Reed-Solomon code C. 

In the following sections, we investigate the relation be- 
tween the code rate R, the maximum error correcting radius 
i max , and the parameter /. 

A. Maximum Error Correcting Radius 

Using only the literal syndrome S = <SW to decode C = 
1ZS (q; n, fc), we are able to decode errors up to half the 
minimum distance, i.e., we obtain the error correcting radius 

''max — 'max 



By using = <SM together with an auxiliary syndrome = 

of length n — 2k + 1, the system of equations (8) may have a 

unique solution as long as 

t < 2n - 3fc - 2t + 1 , 



t 



[2] 



2n - 3fc + 1 
3 



(9) 



Generally, by using the I syndromes SW,^ 2 !, . . . ,5^, (8) 
may have a unique solution, as long as t fulfills 

1(1+1), , 1(1-1) 
t<ln- v Z, k + - lt + 2 ■ 

Consequently, the maximum error correcting radius is deter- 
mined by 



max 



2ln - 1(1 + l)k + 1(1 - 1) 
2(1+1) , 



(10) 



Equation (10) coincides with the error correcting radius of a 
heterogeneous IRS code composed of the I Reed-Solomon 
codes C (1) ,C (2) , . . . ,C (i) as described in [20]. 



B. Threshold Rates 

To understand, how the choice of I is restricted by the rate 
of C, we consider the spectrum Y(x) = & (y(x)) of the 
received word y(x) = c(x) + e(x) and the spectrum of the 
auxiliary polynomial (x). As explained in Section II, each 
codeword c(x) G C has n — k consecutive zero coefficients 
in its spectrum, and therefore there exist n — k consecutive 
coefficients in Y(x) which only depend on e(x). Informally 
speaking, the spectrum of y(x) possesses a "window" of size 
n — k through which we are able to observe n — k coefficients 
of the transformed error E(x) = J^"(e(x)). As depicted 
in Fig. 2a, these coefficients constitute the literal syndrome 
S = S {1] . Now, we consider the spectrum 

Y {2) (x) = Y 2 (x) = (C(x)+E(x)f = C 2 (x)+E [2] (x) . 

The spectral bandwidth of Y 2 (x) is k — 1 elements larger 
than the bandwidth of the original codeword spectrum Y(x). 
Thus, as illustrated in Fig. 2b, the size of the "window" in 
the spectrum Y^ 2 '(x) is reduced by k — 1 coefficients, which 
results in an auxiliary syndrome of length n — 2k + 1. 



time domain 



frequency domain 



y(x) = c(x) + e(x) „ * 777 


observable window — > <S'^ 


(a) Calculating the literal syndro 


n — k 


me S = SM 


j/ 2 >(x) = c (2> (x) + e [2 )(x) ??? ??? | obs. window — > 5 [2] 


n - 2fe + 1 , 



(b) Calculating the auxiliary syndrome 5 1 2 ' 
Fig. 2. Rate restrictions with virtual Syndrome Extension. 

Clearly, if R > 1/2, the "window" in the auxiliary spectrum 
Y^(x) = & (y^(x)) is completely closed, which means 
that we cannot obtain a non-empty auxiliary syndrome S^ 2 K 
Moreover, even if there exists a non-empty syndrome <S^, 
this syndrome can only be used to extend the error correcting 
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radius if its length is greater than I ^V^J 
syndrome extension can only be used if 



- 2k + 1 > 



1. In other words, 



1 



is satisfied. Solving this inequality for R = k/n yields that the 
error correcting radius can only be extended for codes of rate 
R < 1/3. Hence, exactly like the classical Sudan algorithm 
[3], our syndrome extension procedure is only suitable for 
low-rate Reed-Solomon codes. 

Generally, the decreasing lengths of the auxiliary syndromes 
, . . . , <5>W induce a restriction on the maximum rate of a 
code, for which I syndromes effectively can be used. More 
precisely, to apply I > 1 syndromes, the code rate has to be 
small enough such that 



*Lx ] + 2 < n - l(k - 1) - 1 



(ID 



is satisfied. For a fixed parameter I we call the maximum 
rate of code C of length n for which (11) is satisfied the Ith 
threshold rate of C. The other way round, for a given Reed- 
Solomon code of length n and rate R = k/n, the parameter I 
should be chosen to be the largest positive integer for which 
(11) is satisfied in order to maximize the error correcting 
radius. If (11) cannot be satisfied for I > 1, then no auxiliary 
syndrome can be created and the maximum correcting radius 
is limited by half the minimum distance. 

A closed form approximation for the hh threshold rate can 
be obtained by neglecting the floor operation in (10). Inserting 
(10) into (11) and skipping |_-J yields 

2 I 2 



R 



(0 

th 



31 



1(1 + 1) nl(l + l) 1(1 + 1) ' 



(12) 



where RlJ is the rate below which we are always able to 
use the I auxiliary syndromes S^ 2 \ . . . ,S^ l \ and consequently 
obtain the error correcting radius tn] ax - Vice versa, for a given 
code C of length n, dimension k > 1, and R < R^, we are 
able to solve (12) for I, and find that the number of generable 
syndromes is at least 



I = 



^(Rn + 3) 2 + 8(Rn - l)(n - 1) - (Rn + 3) 
2(Rn- 1) 



(13) 

Note, that since we neglect the [ J operation for calculating 
i?[ h \ there might be cases (particularly for small code lengths) 
for which we are able use / syndromes even for code rates 
slightly higher than R®- 

The error correcting radius calculated by (10) and the 
threshold rates specified by (12) largely coincide with the error 
correcting radii and the threshold rates realized by the classical 
Sudan Algorithm [3]. Moreover, there are further relations 
between our syndrome extension algorithm and the Sudan 
algorithm. In [12], Roth and Ruckenstein proposed an algo- 
rithm for efficiently calculating the Q-polynomial required 
by the Sudan algorithm. This algorithm applies an iterative 
procedure, which synthesizes a bivariate polynomial Q*(x,y) 
from a bivariate syndrome S(x,y) = 2~ZL=i S^ h \x)y % . The 
coefficients S^ h \x) of this polynomial are equivalent to the 
syndrome sequences S^ h \ which we use to state (8). However, 



Roth and Ruckenstein use their algorithm for synthesizing a 
Q-polynomial, from which the valid decoding solutions have 
to be extracted by a recursive reconstruction algorithm. In 
contrast to this, we simply use the syndromes SM, . . , 
for determining the error locator polynomial A(x). 

There exists another Sudan based algorithm proposed in 
[13]. This algorithm also does not create a list in ambiguous 
situations, but fails if no unique solution can be obtained. 
Unlike our algorithm, the algorithm from [13] is based on 
a polynomial interpolation approach, which aims in directly 
reconstructing C(x), rather than using a two step strategy, first 
calculating an error locator polynomial and then evaluating the 
erroneous symbols. Moreover, the algorithm in [13] requires a 
matrix inversion, which has a computational complexity in the 
order of O (n 3 ), when performed by Gaussian elimination. 

We now describe a method for synthesizing A(x), which 
is based on multi-sequence shift-register synthesis. Our 
algorithm requires simply I times the complexity of the 
Berlekamp-Massey algorithm, which is known to be not 
larger than O (n 2 ) (see e.g. [21]). Hence, the computational 
complexity of our algorithm is in the order of O (n 2 ). 

V. Efficient Error Location by Multiple 
Shift-Register Synthesis 

For classical BMD decoding as described in Section II, 
the linear system of equations (2) can be reformulated into a 
linear shift-register synthesis problem which can efficiently be 
solved by the Berlekamp-Massey algorithm. As described in 
[20], the same principle can also be applied for decoding IRS 
codes. This means, we can reformulate (8) into the following 
set of I linear recursions: 



Sf =-j2A h Sfl h ,j = t,...,n-k-l, i = l, 



h=l 



(14) 

For homogeneous IRS codes, the I syndrome sequences 
Sj , Sj, . . . , Sj involved in this set of linear recursions 
all have the same length. In this case (14) could be solved by 
the Feng-Tzeng algorithm [22], [23]. However, the syndrome 
extension method explained above generally yields heteroge- 
neous IRS codes. In this case, the lengths of the syndrome 
sequences Sj , Sj 2 \ . . . , Sj are different. As shown in [24], 
[25], the Feng-Tzeng algorithm does not always yield a correct 
solution in this case. Hence, we use the modified multi- 
sequence shift-register synthesis algorithm proposed in [24], 
[25]. A description of this algorithm adapted for our purposes 
is given by Algorithm 1. If we apply this algorithm to I syn- 
drome sequences <SW, . . . ,5^, its computational complexity 
will be smaller than I times the complexity of the Berlekamp- 
Massey algorithm. 

If (8) does not have a unique solution, the polynomial A(x) 
may not be a valid error locator polynomial. To be able to 
detect this, we consider the following definition: 

Definition 3 (t-valid A-polynomial) 

A polynomial A(x) over ?„ is called t-valid if it is a polynomial 
of degree t and possesses exactly t distinct roots in ¥ q . 
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Algorithm 1: Efficient Error Location Algorithm 



n-h(k-l)-2 



input: =1 sf< \ ,h=l,...,l 
begin 

t <— 0, A(x) *— 1, m *— n — k 
rrih <— length (5^), AfW 

<- 0, d^ <- 1, tS h \x) <- 0, for = 1, . 
for eac/i 71/ /ram 1 fo m do 
for eac/i /i from 1 to I do 
if M — t> m— nih then 



,/ 



(ft) 

M—j—m-j-mh 



if d ^ then 

if M - 1 < Af - then 

LAt^AfxJ-^AC")^-^ 1 

else 

t <- t, A(x) <- A(x) 
A(^A(r)-^AW§^ w 
i <-M- (MW -iW) 

<_£ AP , )(i)*-A(j;) 
dC 1 ) <- d, M<» «- M 



end 

output: t, A(x) 



By definition of the error locator polynomial, a polynomial 
A(x) can only be a valid error locator polynomial, if it 
possesses exactly t distinct roots ¥ q . Hence, we accept a A- 
polynomial obtained from Algorithm 1 only if it complies with 
Definition 3. 

After we found a t-valid polynomial A(x), its t roots 
a _JI , a -32 , . . . , a _Jt indicate the positions 31,32, ■■■ ,jt of 
the erroneous symbols in the received word y(x). Error 
evaluation, i.e., determining the values of the error symbols 
can then be performed with respect to the original Reed- 
Solomon code C using standard methods like the Recursive 
Extension [10], or by adopting the Forney Algorithm [9]. 

Based on syndrome extension, Reed-Solomon decoding be- 
yond half the minimum distance can be performed by the fol- 
lowing steps: For a corrupted word y(x) of a Reed-Solomon 
code C = 1ZS (q;n, k) of rate R < 1/3, we first determine 
the largest integer I, which satisfies (11). Then, we create a 
matrix Y accordant to (6) by raising the received symbols 
Uo, yi, . . . , y n -i to the ith power for i = 1, . . . , I, We consider 
Y as received word of a heterogeneous IRS code and calculate 
a syndrome sequence for any row pursuant to (7). Then, we use 
Algorithm 1 to synthesize a pair t,A(x). If A(x) is a t-valid 
polynomial, then we find the roots a - -* 1 , oT^ , . . . , a~-' t , and 
evaluate the errors at the positions ji, ■ ■ ■ , jt by the Forney 
algorithm, by Syndrome Extension, or by any other standard 
method for error evaluation. The complete decoding procedure 
is summarized by Algorithm 2. Generally, depending on 
the received word y(x), Algorithm 2 may yield the following 
results: 



if t < 



1) The algorithm may yield the transmitted codeword, i.e, 

c(x) = c(x). 



Algorithm 2: Decoder Based on Syndrome Extension 
input: received word y(x) = tq + y±x + ■ ■ ■ + y n -\x r ' 
determine largest integer / satisfying (11) 
for each i from 1 to I do 
|_ compute (x) = y l a + y\x H h yl^iX 71 ' 1 

create matrix Y from y^> (x), i = 1, . . . , I 
use DFT to calculate syndromes <SM, . . . , <S^ 
compute t, A(x) by Algorithm 1 

and A(x) is t-valid then 
evaluate errors, and calculate e(x) 
calculate c(x) = y(x) — e(x) 

else 

decoding failure 
output: c(x) £ C or decoding failure 



2) The algorithm may decode a wrong codeword, i.e., 

c(x) ^ c(x). 

3) The algorithm may not yield a codeword at all, i.e, it 
may yield a decoding failure. 

These tree events occur with a certain probability. In the 
following, the probability or a correct decision is denoted by 
P c , the probability for a decoding error is denoted by P e , and 
the failure probability by Pf, Hence, the probability P w for 
an incorrect decoding result is obtained by P w = 1 — P c = 

Pe + Pf- 

Algorithm 2 possesses an interesting property, which is 
called Maximum Likelihood certificate property. The term 
ML certificate property was introduced in [26], where it is 
shown that a decoding algorithm based on linear programming 
exhibits this property. Formally, we define the ML certificate 
property as follows: 

Definition 4 (ML Certificate Property) 

Consider a code C, and assume that the word y is received 
when a codeword c G C is transmitted over a memoryless noisy 
channel. Moreover, consider a decoding algorithm which either 
decides on a unique codeword, or yields a decoding failure. 
We say that the decoding algorithm exhibits the ML certificate 
property, if it behaves in such a way that whenever it yields a 
codeword c 6 C, this codeword fulfills 

dist (y, c) = min {dist (y, c)} . 



In [20], a IRS decoding algorithm based on multi-sequence 
shift-register synthesis is proposed. Moreover, it is shown that 
this algorithm possesses the ML certificate property. After cre- 
ating the matrix Y from the received word y(x), Algorithm 2 
considers Y as received matrix of an IRS code and processes 
it exactly like the IRS decoding algorithm proposed in [20]. 
Thus, the correctness of the following theorem directly follows 
from the results proved in [20]. 

Theorem 1 Algorithm 2 possesses the ML certificate prop- 
erty. 



X 



VI. Error and Failure Probability 

The syndrome extension procedure described above in- 
creases the maximum error correcting radius by increasing the 
number of linear equations for calculating the error locator 
polynomial. In this case, (8) can have a unique solution 
for a larger number of unknowns t. However, it really only 
has a unique solution, if the additional equations obtained 
from the auxiliary syndromes are linearly independent, i.e., 
if rank (S) = t. Otherwise, a decoding failure is obtained. 
Hence, syndrome extension effectively increases the decoding 
performance only, if the failure probability Pf is fairly small. 
Consequently, to show that the proposed decoding method 
effectively works, we have to analyze its failure and error 
probability. 

A. Error Probability of Syndrome Extension 

It immediately follows from Theorem 1, that the error 
probability P e is not larger than the error probability of an 
ML decoder. Moreover, we are able to use the following result 
from [20]: 

Theorem 2 (Error Probability) Let C(q; n, k, d) be a linear 
block code of length n, dimension k, and minimum distance 
d over the field ¥ q , decoded by a BD decoder which exhibits 
the ML certificate property. Assume that the decoding radius 
of this decoder is < m ax, and that it decodes a received word, 
which is corrupted by t errors. Then, the probability for a 
decoding error is overbounded by 



Pe(t) < P e (t) 



El + ir^En {t ' Wx} -^(t,-,P) 



(15) 

where A w is the weight distribution of the code, that is, A w is 
the number of codewords of weight w in C, and U q (r2, ri, p) 
is calculated by 



U q (r 2 ,n,p) 



ri+r 2 -p 

E 

N-r{ 

7'2 - 1 



- (n + r 2 ) + 2i 
(q-2) p - {ri+r2)+2l {q-l) 



(16) 



Since Algorithm 2 possesses the ML certificate property, 
Theorem 2 can directly be applied for overbounding the error 
probability P e . 

B. Failure Probability 

Even though Y is considered as a received word of a 
heterogeneous IRS code, we cannot apply the upper bound 
on the failure probability derived in [20]. The reason for this 
lies in the fact, that for deriving the bound in [20], it is 
assumed that the erroneous columns in the received matrices 
are distributed uniformly over all non-zero vectors. Obviously 
this is not true for a heterogeneous IRS code obtained by 
syndrome extension, since all symbols in an erroneous column 
j are just powers of the error symbol ej. Consequently, 



the analysis of the failure probability of syndrome extension 
decoding is substantially more involved than the analysis of 
the failure probability for ordinary IRS decoding. However, for 
fields of characteristic 2 and for I = 2, i.e., for Reed-Solomon 
codes of rate l/6<-R<l/3 defined over the extension field 
¥2™, we are nevertheless able to derive a bound similar to 
the bound for ordinary IRS codes derived in [20]. This allows 
for an analytical evaluation of the decoding performance of 
Algorithm 2 for the most interesting low-rate Reed-Solomon 
codes. Later, we complement this analysis by experimental 
results obtained from Monte-Carlo simulations, which show 
that Algorithm 2 also works quite well, when it is applied to 
Reed-Solomon codes of rate smaller than 1/6, i.e., for I > 2. 

For a Reed-Solomon code C over F 2 ™ with a rate in the 
range l/6<i?<l/3, for which the largest integer satisfying 
(11) is I = 2, Algorithm 2 creates a matrix Y with two rows 
by squaring the coefficients of the received word y(x). Since 
the code C is defined over a field of characteristic 2, we have 
rf = (ai + ei) 2 = a 2 + e 2 , which means that Y can be written 
as 



Vo Vi 

2 2 

c ci 

r 2 r 2 



Vn-l 
V 2 



f'O 
,,2 



ei 



en-l 
e n-l 



(17) 

For a matrix Y in this form, we are able to derive an upper 
bound on the failure probability of Algorithm 2. To do this, we 
overbound the number of cases, in which the linear system of 
equations (8) has multiple solutions. Pursuant to the following 
lemma, which is a variant of Lemma 5 from [20], this yields 
an upper bound on the failure probability of Algorithm 2: 

Lemma 3 Consider a codeword c(x) of a Reed-Solomon 
code C, for which (11) is satisfied with I > 1. Assume that this 
word is corrupted by an error of weight t < < max and that 
Algorithm 2 yields a decoding failure. Then, the linear system 
of equations (8) with t unknowns has multiple solutions. 

Proof: Assume that a codeword c{x) £ C has been 
transmitted, and that the channel adds an error word e(x) 
with t non-zero elements at the positions ■ ■ ■ ,jt- Then 

Algorithm 2 creates a matrix Y with t erroneous columns. 
Algorithm 2 only yields a decoding failure, if Algorithm 1 
computes a pair t',A'(x), such that A'(x) is not t'-valid. 

First we show, that t' < t. We know, that the polynomial 
A(x) = Ili=i(l ~ a^x) is a t-valid polynomial, which 
represents a solution of the set of linear recursions (14). Since 
Algorithm 1 always yields the smallest length shift-register 
which is a solution of (14), we know that t' < t. 

Second, A'(x) is a solution of the set of linear recursions 
(14) of length t', and hence also for the system of equations 
(8) with t' unknowns. However, if A'(x) is a solution for the 
set of linear recursions (14) of length t', it is also a solution 
for length t > t', since if A' (a:) fulfills (14) for i > t', it 
obviously also fulfills (14) for i > t > t' . Thus, A'(x) is not 
only a solution of the system of equations (8) with t' unknowns 
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and the set of linear recursions (14) of length t', but also for 
the system (8) with t unknowns, and the set of recursions (14) 
of length t. Consequently, the system of equations (8) with t 
unknowns has at least the two solutions A(x) and A'(x). This 
proves the statement of Lemma 3. ■ 

Note, that from Lemma 3 we only know that (8) always 
has multiple solutions if Algorithm 2 fails. Contrariwise this 
does not mean that Algorithm 2 always fails if (8) has 
multiple solutions. Nevertheless, by counting the cases in 
which (8) multiple solutions, we obtain an upper bound on 
the probability Pf that Algorithm 2 fails. 

In the next step, we consider the following lemma, which 
allows us to utilize the structure of a matrix Y pursuant to 
(17): 

Lemma 4 Let (v, e) be a pair of two non-zero field elements. 
Then, the set 



V= LI? : v,eeW q \{0} 



forms the set of all full weight vectors of length 2, i.e., 
V={veF 2 q : wt(u) = 2} = [F,\{0}] 2 . 



Proof: Since there exist (q — l) 2 possibilities to select the 
pairs (v, e), it is sufficient to show that all (q— l) 2 vectors v G 
V are pairwise different. Indeed, by considering two vectors 
v,v G V, and assuming that v = v, we get 



ve , 
ve 2 . 



(18) 
(19) 



Dividing (19) by (18) yields e = e, and by inserting this into 
(18) we obtain v = v. Hence, two different pairs (v, e) ^ 
(e, v) always yield two different vectors v, v. ■ 
Next, we consider the following lemma, which is a variant 
of Lemma 6 from [20]: 

Lemma 5 Let , be two q-ary linear codes of length 
U, and let the dimension of W^ 1 -* be uj — and the 

dimension of be cu — g^ 2 \ Further, let 

W = ( f o\ 1 = (W Q , • • -,VJu-l) 



be a 2 x u> matrix such that wt (wj) = 2, Vj = 0, . . . , uj — 1, 
i.e., W does not contain any zero element. Moreover, assume 
that all columns of XV are uniformly distributed over all full 
weight vectors of length 2. Then, the probability P u that 



G 



is overbounded by 



L -(e (1) W 2) ) 

2uj * 



(9-1) 



(20) 



(21) 



Proof: Let C be the set of all 2 x to matrices whose rows 
fulfill (20). Further, let be the set of all 2 x uj matrices with 
elements from ¥ q , and let the subset <S^ C S u be the set of 



matrices without any non-zero elements. Then, the probability 
P^ for a matrix W without zero elements to fulfill (20) is 
overbounded by 

_ \cnsj\ |£j 
\s'J - \s' u \ • 

The cardinality \C\ is calculated by by 

\£\ = \W^\-\W^\^q^-( e(1>+ ^ 2> ) , 
and | S[j | is obtained by 

i5:i = («z-i) 2w . 

Dividing \C\ by |<Sj^| yields the right hand side of (21), which 
proves Lemma 5. ■ 
By utilizing Lemma 3, Lemma 4, and Lemma 5, we are 
now able to overbound the failure probability for codes over 
fields of characteristic two and for I = 2 by the following 
theorem: 

Theorem 3 (Failure Probability) Let C be a Reed-Solomon 
code over F 2 m, for which I = 2 is the largest integer satisfying 
(11). Assume that a codeword a(x) G C is corrupted by an 
error word e(x) with t non-zero coefficients, and that the 
resulting word r(x) — a(x) + e(x) is decoded by Algorithm 2. 
Then, the probability for a decoding failure is overbounded by 

Pf{t) < Pf(t) =[-^ + -) ■ ^— , (22) 



q 

q-i 



9-1 



where t v 



.[2] 



is calculated by Equation (9). 



Proof: According to Lemma 3, the failure probability of 
Algorithm 2 can be overbounded by investigating the linear 
system of equations (8). More precisely, we are interested in 
the cases in which (8) has multiple solutions, i.e., the cases 
in which rank (S) < t. We have such a case, if there exist a 
vector ii^O such that S ■ u = 0. Equivalently, we can say 
that (8) cannot have a unique solution, if 

3u^O:S I ' 1 -u = 0V! = l,2. (23) 

Since both syndrome matrices and are determined 
by the same error word e = (eo, e\, ■ ■ ■ , e„_i), we are able 
to express the failure probability Pf(t) by 

p M _ |{e : wt (e) = t and (23) holds} 
fW ~ \{e : wt(e = t)}| ' 

Now, we fix t positions ji, j'2, . . . ,jt, and define the en- 
semble St (ji, ■ ■ ■ , jt) of equiprobable vectors e of weight 
wt (e) = t that may take all possible non-zero values 
at the indices ji, J2, ■ ■ ■ , jt, and are zero in all remaining 
coordinates. The cardinality of this ensemble is calculated 
by \£t (ji, ■ ■ ■ , jt)\ = (q — !)*■ Moreover, we define the 
probability Pf(ji, ■ ■ ■ ,jt) that (8) is satisfied for vectors from 
the ensemble £ t (ji, . . . ,jt) by 



Pf{h,---,3t) = 



\{ee£t{h,h,---,3t) ■ (23) holds} I 



\£t (ji 



,jt) 



(24) 



10 



For overbounding Pf(ji, ■ ■ ■ ,jt), let the number of rows in 
S*- 1 ' be denoted by g^ = n — k — t, and the number of rows 
in be denoted by = n — 2k — t + 1. It is known 
(cf. e.g. [10]) that a syndrome matrices i = 1,2, can be 
decomposed into 

S [t\ = H M . p-W . £> . y t , 



where 



Vi = 



»2J2 



V 



a 



2jt 



v (t-l)j t 



is a i x t Vandermonde matrix, the matrices 



D = diag (a 31 , a j2 , . 



and 



diag (4, e 



are t x t diagonal matrices, and the matrix 



/ 



1 



v 2ii 



1 



a 



2j, 



is a x i matrix consisting of g^ rows of a transposed 
Vandermonde matrix. In other words, if' 1 ' can be considered 
as parity-check matrix of a (shortened) Reed-Solomon code 
of length t and dimension t — g^\ which we denote by 
W'*'. Since the matrices Vt and £) both have full rank, the 
product v = DVt ■ u defines a bijective linear map u i— > v. 
Consequently, the statement 

3v ^ : H®F® v = Vi = 1,2 (25) 

is equivalent to Equation (23). Due to this equivalence, we are 
able to calculate 2V& by 

7V fa = |{eef t (ji j t ) : (23) holds}| = 

= |{ee£ t (ji,...,j t ) : (25) holds}| • 

However, instead of directly counting the error vectors e 
satisfying (25), we consider all pairs (v, e) of vectors, which 
satisfy 

H [i] F [i\ . v = Vi = 1,2 . (26) 

In this way, we are able to overbound Nj, by 

N b < \{(v,e):e££ t (j 1 ,...,j t ),veW t q \{0}, (26) holds}| 

(27) 

This is a Union Bound, where error vectors e satisfying (25) 
may be counted several times with different vectors v. 

To improve (27), we take into account that if (26) holds for a 
pair (v, e), then it also holds for all pairs (av, e), a G F g \{0}. 
Hence, every vector e is counted in (27) at least q — 1 times, 
which means that we are able to improve (27) by the factor 
q — 1. In this way, we obtain 

N b <^ \{(v,e):ee£ t (ji, . ..,jt),(26) holds}) . 



Partitioning {d e \ {0}} into the sets V w . . . of 
vectors v with uj non-zero elements at the indices i\ , i 2i ■ ■ ■ , i u 
yields 



lo=1 (»!,...,»„,) «eV M (ii,---,iw) 

and by inserting (28) into (24), we obtain 



PfUl,---,jt) < 



< 



^E E E 



M{v) 



-1 (ti,. ..,!„) ueVuOti, ...,1a,) 



(29) 



Now, we reconsider the fact that if' 1 ' is a parity-check 

matrix of the code W' 1 ', and denote by = ■ v^j 

the vector obtained by multiplying v and F ^. This enables 
us to state another condition, which is equivalent to (26): 



,W e wW V* = l,2. 



(30) 



Consequently, instead of a pair (u,e), we equivalently may 
consider the matrix 



W 



[2] 



vie h 

2 



vie 



v 2 e h 

= 2 



31 



v 2 e 



32 



v t e jt 
vtel 



and write H^'W = instead of (26). For a vector v with 
non-zero elements at the indices i\, i 2 , . . . , i w , the correspond- 
ing matrix W possesses exactly u> non-zero columns, from 
which we create a2xw matrix W w . Moreover, by creating 
the 2 x uj matrix H$ from the columns i\, i 2 , . ■ ■ , i u of H^, 
the statements H^W T = and 



(31) 



are equivalent. Now, we show that the total number | \ of 
matrices satisfying (31), can be calculated by 



P,„ ■ \Wu 



Puj ■ |V W (h, ■ . ■ \S t (ji, . . . ,j t ) 



(32) 



To do this, we consider the set of all matrices satisfying (31), 
which are obtainable from all vectors v G V w (ii, . . . 
and e G £ t (ji, . . . , j t ). We observe that the elements Vj and 
ej i in these matrices may independently take all values from 
¥ q \ {0}. Hence, by applying Lemma 4, we conclude that the 
columns of all matrices are independent random vectors 
from [F 9 \ {0}] 2 . This in turn enables us to apply Lemma 5 for 
overbounding the probability P^ for a matrix W ' u to satisfy 
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(31). Finally, by combining (29) and (32) we obtain 



tE 



■ 9 



-(e (1) + e <2) ) 



g -(^)+.W) « / f 
9- 1 ^> 



9 



< 



9-1 

9 

9 



9-1 
9 

9-1 



E 

t 

9 



9-1 

2 



< 



9 



9-1 

(2n-3fe-2i+l) 



1 



1\* „-3-(W*-t) 



9-1 



Since the right side of (33) does not depend on the selection 
of ji, j2, . . • , jt, but only on weight t of the error vector e, it 
also provides us with an upper bound on Pf(t). This proves 
Theorem 3. ■ 
Since the factor 7 is close to one, the failure probability 
Pf{t max ) is in the order of l/(q — 1) for the maximum 
error correcting radius £ max - Moreover we see, that the failure 
probability decreases exponentially with decreasing t. Hence, 
(22) behaves rather similar to the upper bound on the failure 
probability for ordinary IRS codes. The only difference is, that 
for an IRS code obtained by syndrome extension, the factor 7 
is slightly larger. 

C. Probability P w in the QSC 

If we use Algorithm 2 for decoding a Reed-Solomon 
codeword which is corrupted by t < r = [^ir^J errors ' 
the decoding result is always unique and we are always able 
to correct the errors. Hence, the probability for an incorrect 
decoding result is P w = in this case. If the number of 
errors is in the range t < t < t ma J- l \ Algorithm 2 yields a 
decoding failure with probability Pf > 0, and a decoding error 
with probability P e > 0. However, since the probability for 
an incorrect decoding result P w = P e + Pf is generally much 
smaller than one, we are still able to correct the errors with the 
probability P c > 0. If t > f max ''', Algorithm 2 either yields a 
decoding failure, or an erroneous decoding result. Hence, we 
have the probability P w = P e + Pf = 1 in this case. Based 
on these facts, the probability P w for an incorrect decoding 
result in the QSC with crossover probability p is calculated in 
the usual way by 



P„ 



E 

t=T+l 



(P e (t) + P f (t)) ■ p* • (1 - pY 



Clearly, for Reed-Solomon codes with rate l/6<i£<l/3 
we are able to overbound P w by replacing P e (t) and Pf (t) by 
their upper bounds (15) and (22) respectively. More precisely, 
we replace P w {i) by 



P w {t) 



{P e (t) + P f (t),l} 



(34) 



where P e (t) is computed by means of Theorem 2 and P f (t) 
is calculated by means of Theorem 3. In this way, we obtain 
the following corollary: 

Corollary 4 Let C be a Reed-Solomon code over ¥2^ for 
which I = 2 is the largest integer satisfying (11). Assume 
that the codewords of this codes are transmitted over a QSC 
with crossover probability p, and that the received words are 
decoded by Algorithm 2. Then, the probability P\^ 2 to obtain 
an incorrect decoding result is overbounded by 



pl=2 < p'"2 



E 

t=r+l 



p^-pt-ii-py 



(35) 



where P w (t) is given by (34). 



(33) 

D. Error Probability versus Failure Probability 



Comparing the upper bound (15) on the error probability P e 
and the upper bound (22) on the failure probability Pf yields 
that the error probability is generally much smaller than the 
failure probability. To demonstrate this, we consider the Reed- 
Solomon code 1ZS (2 8 , 255, 63). By classical BMD decoding 
we are able to correctly decode all error patterns with t < 96 
non-zero symbols. Moreover, Algorithm 2 is able to decode 
error patterns in up to < max = 107. Calculating the upper 
bounds (15) and (22) for the interval 97 < t < 107 yields the 
results depicted in Fig. 3. We observe that the error probability 
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Fig. 3. Upper bounds on the failure and error probability for a Reed-Solomon 
code 7^5 (2 8 ; 255, 63) decoded by Algorithm 2. 



P e is smaller by more than 100 orders of magnitude than the 
failure probability Pf. This means, that from a practical point 
of view the error probability P e is absolutely negligible for the 
decoding performance of Algorithm 2. The probability P w for 
an incorrect decoding result is absolutely dominated by the 
failure probability Pf. Consequently, Theorem 3 should be 
considered as one of the most important results in this paper, 
since it theoretically proves the effectiveness of syndrome 
extension decoding. 
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E. Experimental Results 

To verify the tightness of the bound stated by Theorem 3, 
we would like to compare it with the actual failure probability 
obtained by Algorithm 2. In principle, we are able to obtain 
the actual failure probability by Monte-Carlo simulations. 
However, since Pf decreases with the field cardinality |F g |, 
reliable results can only be obtained for Reed-Solomon codes 
over fields with small cardinalities. Therefore we considered 
Reed-Solomon codes over the field F 2 s for our experimental 
investigations. 

Pursuant to the equations (10) and (13), the Reed-Solomon 
code TZS (2 5 ,31,6) can be decoded by Algorithm 2 using I = 
2 syndromes up to the maximum decoding radius < max ' 2 ' = 
15. In contrast to this, a classical BMD decoder is able to 
correct up to 12 errors. Thus, we performed experiments in 
which we created N = 10 8 random error vectors of weight t 
for each t in the interval 12 < t < 15, and counted the number 
Nf of decoding failures. The results of these experiments are 
presented in Table I. 

TABLE I 

Experimental results on the failure probability Pf (t) for the 
Reed-Solomon code TZS (31, 6) 





TZS(2 b 


;3i,6) 




1 


2 


t [2J 

'max 


15 


i 


13 


14 


15 


N 


10* 


10 s 


10 8 


N f 





88 


3025500 


Pf 


<io~ 8 


Ri 9 ■ 10"' 


pa 3 ■ 10-* 


P f (22) 


6.7 ■ 10" 11 


2.3 ■ 10~ G 


8.1 • 10~ 2 



From Table I we observe, that Theorem 3 gives a quite good 
estimate for the actual behavior of the failure probability for 
Reed-Solomon codes over fields of characteristic 2 decoded 
by Algorithm 2 with I = 2, i.e., for Reed-Solomon codes of 
rate 1/6 < R < 1/3. We see that Pf(t) actually decreases 
exponentially with decreasing t like predicted by Theorem 3. 

To investigate whether we have a similar behavior for I > 3, 
we consider the Reed-Solomon code TZS (2 ; 31, 4J, which 
can be decoded by a BMD decoder up to the radius 14, and 
by Algorithm 2 with I = 3 up to the radius i max ' 3 ' = 18. We 
again performed N = 10 8 random experiments for each t in 
the range 14 < t < 18. Table II shows the results of these 
experiments. 

TABLE II 

Experimental results on the failure probability Pf (t) for the 
Reed-Solomon code TZS (31, 4) 







TZS (2 


b ;31,4) 
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t [3J 


18 
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15 


16 


17 


18 
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10" 


10" 


10« 


10" 


Nf 








37 


3121501 


Pf 


<icr 8 




« « 4 -10-'' 


3 • 



Form Table II we see the basic behavior of Pf (t) for I > 2 
is similar to the case I = 2, even if we are not able to derive 
an upper bound for this case. Hence, from a practical point 
of view, we can say that our syndrome extension procedure 
can effectively be applied for all Reed-Solomon codes of rate 
R % 1/3- 

VII. Decoding Performance 

As already mentioned, the threshold rates and the error 
correcting radii of Algorithm 2 are similar to the classical 
Sudan algorithm [3]. However, since the decoding strategy 
in the case of an ambiguous decoding result is different, the 
question is how our syndrome extension approach actually 
performs in comparison to the Sudan algorithm. To study 
this, and to give a vivid expression of what can be gained 
by decoding errors beyond half the minimum distance, we 
complement this paper by Monte-Carlo simulations on the 
q-ary symmetric channel (QSC), in which we compare the 
decoding performance of classical BMD decoding, the original 
Sudan algorithm [3] and Algorithm 2. 

At first, we consider the code TZS (2 8 ; 255, 63). A BMD 
decoder can correct up to 96 errors with this code. In contrast 
to this, Algorithm 2 is able to decode error patterns up 
to t max = 107. The same holds for the original Sudan 
algorithm [3]. Fig. 4 shows the QSC decoding performance 
for a BMD decoder, for Algorithm 2 and for the Sudan 
algorithm. If a transmitted symbol is received correctly with 
probability 1 — p = 0.7, we observe that the word error 
rate obtained by Algorithm 2 is more than 100 times smaller 
than the word error rate obtainable by BMD decoding. The 
simulation results for the Sudan algorithm coincide very well 
with the simulation results obtained by Algorithm 2. Hence, 
our algorithm basically yields the same decoding performance 
as the classical Sudan algorithm. Moreover, we see that the 
bound calculated by (35) is only slightly worse than the actual 
word error performance of Algorithm 2, which means that 
Corollary 4 is well suited to get reliable estimates for the 
decoding performance. 

Next, we consider the code TZS (2 8 ; 255, 38), whose rate 
is approximately 0.15 and pursuant to (13) we have I = 3. 
For this case, we do not have an analytical bound on P w . 
However, the simulation results depicted in Fig. 5 show that 
also for this case, the decoding performance of Algorithm 2 
is basically identical to to the decoding performance of the 
Sudan algorithm. Hence, from a practical point of view, our 
decoding algorithm can compete well with the original Sudan 
algorithm [3]. Moreover, if the crossover probability of the 
QSC is approximately p = 0.4, that is, if the transmitted 
symbols are received correctly with probability 1 — p = 0.6, 
Algorithm 2 achieves a word error rate which is more than 
10000 times smaller than the word error rate obtainable by 
classical BMD decoding. 

VIII. Conclusions 

In this paper, we have considered a syndrome based method 
for decoding low-rate Reed-Solomon codes beyond half the 
minimum distance. This is achieved by virtually extending the 
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Fig. 4. Comparison of the QSC decoding performance of the Reed-Solomon 
code 1ZS (2 s ; 255, 63) for BMD decoding, Algorithm 2, and the Sudan 
Algorithm. 
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Fig. 5. Comparison of the QSC decoding performance of the Reed-Solomon 
code 7^5 (2 s ; 255, 38) for BMD decoding, Algorithm 2, and the Sudan 
Algorithm. 



received Reed-Solomon words into words of an heterogeneous 
IRS code. In this way, we are able to use an efficient IRS 
decoding method described in [20], which is based on multi- 
sequence shift-register synthesis, which can be performed in 
the same order of complexity as the well-known Berlekamp- 
Massey algorithm. 

Most results derived in [20] for heterogeneous IRS codes are 
also valid for the "virtual" IRS codes obtained by syndrome 
extension. However, the result on the failure probability P w (t) 
stated in [20] cannot be applied to analyze syndrome extension 
decoding, since the erroneous columns cannot be considered 
to be uniformly distributed over all non-zero columns. 

Nevertheless, even if the theoretical analysis of P w (t) 
is much more involved in the case of syndrome extension 
decoding, we have been able to derive and prove an upper 
bound on the failure probability for codes of rate 1/6 < 
R < 1/3 defined over fields of characteristic two. This 
bound theoretically proves the effectiveness of the decoding 
procedure described by Algorithm 2. To check the tightness 
of this bound, we performed Monte-Carlo experiments for 
codes defined over the extension field F 2 5, and showed that 
our bound coincides quite well with the experimental results. 

Moreover, to compare our decoder with the original Sudan 
algorithm [3], and to get an impression of gains which can 
be achieved by our decoder, we performed Monte-Carlo 
simulations for the g-ary symmetric channel. We observed, 
that the performance obtained by Algorithm 2 is practically 
identical to the performance of the Sudan algorithm. 

The decoding method presented in this paper can be imple- 
mented quite efficiently and can therefore be an interesting 
alternative to the Sudan algorithm. Unfortunately, like the 
original Sudan algorithm, its application is limited to codes 
of rate R < 1/3. For the Sudan algorithm there exists an 



extension by Guruswami and Sudan [11] which overcomes 
this problem. So, its remains a major challenge to extend the 
approach presented here to be able to decode Reed-Solomon 
codes of higher rates. Moreover, it would be desirable to find 
an universal bound on P w (t), which allows it to analytically 
evaluate the performance of Algorithm 2 for arbitrary Reed- 
Solomon codes. 
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